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REMARKS 

Summary of Examiner Interview 

Applicant thanks the Examiner for the interview of November 3, 2004. During 
the interview, both of the remaining rejections were discussed. Applicant's 
representatives argued that the rejection of claims 2-9, as failing to comply with the 
written description requirement of 35 U.S.C. §112, was improper because the 
sequence of human antithrombin III was well known in the art at the time the 
application was filed and because human antithrombin III is described by multiple 
reference points within the specification. The Examiner agreed to discuss the 
rejection with a biotechnology specialist. After the discussion with the specialist, the 
Examiner telephoned Applicant's representative on November 22, 2004, and 
indicated that submission of references to the Office demonstrating that the 
sequence of antithrombin III was known at the time the application was filed may be 
persuasive. Applicant provides such references in the present response. 

Applicant's representatives also argued that the rejection of claims 5 and 9 
under 35 U.S.C. §103, as obvious in light of J.A. Huntington ef.a/., Biochemistry, 
1998, 37: 3272-3277, J.A. Huntington ef.a/., Biochemistry, 1996, 35: 8495-8503, and 
common knowledge in molecular biology, was improper. Applicant's representatives 
stated that one of ordinary skill in the art would not be motivated to substitute an 
amino acid that is less bulky than tryptophan into the 380 position of human 
antithrombin III based on the teachings of Huntington (1996) and Huntington (1998). 
Furthermore, because the amino acids recited in claims 5 and 9 are all less bulky 
than tryptophan, as demonstrated on the bulkiness scale provided by Applicant's 
representatives to the Examiner, the rejection of claims 5 and 9 should be 
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withdrawn. After reviewing the bulkiness scale, the Examiner stated that the 
obviousness rejection could be overcome by deleting amino acids leucine (Leu), 
isoleucine (lie), and valine (Val) from claim 5. Further, the Examiner agreed that 
substitution of the serine at position 380 with the amino acid arginine (Arg) was non- 
obvious, because Arg is substantially less bulky than tryptophan. 

Applicant has amended claim 5 accordingly. Support for this amendment can 
be found in the specification as originally filed at, for instance, original claim 5. 
Therefore, no new matter has been added by way of this amendment. 

Claims 2-9 are pending. 
Rejection under 35 U.S.C., § 112, first paragraph 

The Office maintained the rejection of claims 2-9 as failing to comply with the 
written description requirement of 35 U.S.C. §112, because the amino acid and 
nucleotide sequences of the protein to be modified are not disclosed. See Office 
Action at pages 2-3. The Office argues that the claims are directed to a large genus 
of human antithrombin variants without describing the structure of a representative 
species. Id. Applicant, however, respectfully submits that significant evidence is 
available demonstrating that the inventor had possession of the claimed invention as 
of the filing date. 

Information which is well known in the art need not be described in detail in 
the specification; there is no need to disclose what is already known. M.P.E.P. 
§2163. The amino acid sequence of human antithrombin III was well known in the 
art. Cloning and expression of the cDNA for human antithrombin III was reported as 
early as 1983 (Exhibit 2; T. Chandra et al. Proc. Natl. Acad. Sci. USA, 1983, 
80:1845-1848). As provided in the previous response of August 9, 2004, the 
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Applicant himself published the amino acid sequence in 1994 in a globally- 
distributed journal, The Journal of Biochemistry. See F. Tokunaga, T. Koide et a/., 
"Amino Acid Sequence of Porcine Antithrombin III," J. Biochem. 116: 1 164-1 170, 
1 167 (1994). The Office incorrectly states that "[t]he article by Tokunaga and co- 
workers published in J. Biochem. presents porcine and not human antithrombin III." 
Office Action at page 4. The human antithrombin sequence is disclosed in the 
article and can be found in Figure 2, page 1 167, top line. Furthermore, this article 
refers to the Chandra et al. reference (discussed above) at page 1 168, second 
paragraph. 

"The function of the description requirement is to ensure that the inventor had 
possession, as of the filing date of the application relied on, of the specific subject 
matter later claimed by him." In re Wertheim, 541 F.2d 257, 262 191 USPQ 90 
(CCPA 1976). Given that the present specification is read in light of the inventor's 
prior publication of the natural human antithrombin III sequence, there can be no 
doubt that the inventor had possession of the invention as of the application's filing 
date. 

The specification makes reference to Japanese Patent No. 262598/1990, 
corresponding to EP 0384122, which discloses the complete amino acid sequence 
of mature human antithrombin III. See Specification, page 4; European Patent No. 
0384122, Table 1 (issued January 19, 1990). A copy of EP 0384122 was provided 
with the Information Disclosure Statement filed on December 21 , 2001 . 
Furthermore, the specification of EP 0384122 at page 2, lines 6-8, makes reference 
to European Patent Application EP 0 090 505 A2 (published October 5, 1983), which 
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also discloses the nucleotide and amino acid sequence of human antithrombin III 

(Exhibit 3; Figure 2). 

Applicant has provided the Office with overwhelming evidence that the 

sequence of human antithrombin III was well known in the art at the time of the filing 

date of this application. For this reason, and because human antithrombin III is 

described by multiple reference points within the specification, ensuring that 

Applicant was in possession of the claimed invention, Applicant respectfully 

requests that the §112, first paragraph rejection be withdrawn. 

Rejection under 35 U.S.C. §103 

The Office maintained its rejection of claims 5 and 9 under 35 U.S.C. §103 as 
obvious in light of J.A. Huntington et a/., Biochemistry, 1998, 37: 3272-3277, J.A. 
Huntington etal., Biochemistry, 1996, 35: 8495-8503, and common knowledge in 
molecular biology. Specifically, the Office alleges that "a person skilled in the art 
would have a reasonable expectation of success in substituting serine in position 
380 with any amino acid that is bulkier than serine." Office Action at page 7. 
Applicant respectfully traverses and requests that the Office reconsider this rejection 
in light of the current amendments to claim 5. 

The references cited by the Office do not teach a reasonable expectation of 

success if the amino acid substituted for the naturally occurring serine is less bulky 

than tryptophan. Huntington (1996) teaches that a serine to tryptophan substitution 

produces only "a partially activated conformation." Huntington (1996), page 8502, 

first full paragraph (emphasis added). Huntington (1996) concludes that "[t]his 

raises the possibility that other changes in this critical hinge region of the reactive 

center might even more closely resemble the fully heparin-activated conformation 
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and consequently give antithrombins that inhibit factor Xa at rates approaching those 

of antithrombin-heparin complexes." Id. Thus, based on the teachings of this 

reference, one of ordinary skill in the art would be motivated to replace the naturally 

occurring serine with a more bulky amino acid than tryptophan, in order to produce a 

more activated conformation. 

Huntington (1998) examines replacement of the naturally occurring serine 
with a cysteine. The cysteine, however, is derivatized with a bulky fluorescein. The 
structure of the bulky fluorescein was provided to the Office during the interview on 
November 3, 2004. See Interview Summary at page 1 . Huntington (1 998) teaches 
that the substitution with the cysteine-fluorescein produces an antithrombin that is 
"permanently and fully activated toward reaction with factor Xa in a manner 
analogous to heparin activation" Huntington (1998), page 3272, abstract (emphasis 
added). Thus, the Huntington references together teach that only by substituting 
serine with an amino acid more bulky than tryptophan, can one obtain a fully 
activated antithrombin, without the presence of heparin. 

In determining bulkiness, Applicant had referenced Niwa and Ogino, "Multiple 
Regression Analysis of the Beta-Sheet Propensity of Amino Acids," J ofMol. Struct 
(1996)155-160, at 157. However, Applicant provided a more easily understood and 
generally accepted scale of bulkiness to the Office during the Interview of November 
3, 2004. See Interview Summary at page 1 . A copy of this scale is attached 
herewith (Exhibit 1). 

The Office stated that antithrombin variants made by substituting amino acids 
substantially less bulky than tryptophan would be non-obvious, but amino acids 
similar to tryptophan in bulkiness (Leu, He, Val, Tyr) would be expected to be similar 
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to tryptophan. See Interview Summary at page 3. Applicant respectfully submits 
that one of ordinary skill in the art would not be motivated to substitute any amino 
acid less bulky than tryptophan, as discussed above. However, merely to expedite 
prosecution, Applicant has deleted amino acids lie, Leu, and Val from claim 5. 
Applicant reserves the right to pursue these claim embodiments in a separate 
application. Applicant also added the amino acid arginine (Arg) to claim 5, which is 
substantially less bulky than tryptophan, based on the bulkiness scale provided 
herewith. In light of this claim amendment, Applicant respectfully requests the Office 
to withdraw the obviousness rejection of claims 5 and 9. 
Conclusion 

In view of the foregoing amendments and remarks, Applicant respectfully 
requests the reconsideration and reexamination of this application and the timely 
allowance of the pending claims. 

Please grant any extensions of time required to enter this response and 
charge any additional required fees to our deposit account 06-0916. 



Attachments: 

Exhibit 1 : Scale of Bulkiness 

Exhibit 2: T. Chandra etaL Proc. Natl. Acad. ScL USA, 1983, 80:1845-1848 
Exhibit 3: European Patent Application EP 0 090 505 A2 



Respectfully submitted, 



FINNEGAN, HENDERSON, FARABOW, 
GARRETT & DUNNER, L.L.P. 



Dated: February 18, 2005 




Amy E. Purcell 
Reg. No. 53,492 
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Isolation and sequence characterization of a cDNA clone of human 
antithrombin III 

(natural anticoagulant/plasma protease inhibitor/human liver cDNA library/molecular cloning/DNA sequence) 

T. Chandra, Robin Stackhouse, Vincent J. Kidd, and Savio L. C. Woo* 

Howard Hughes Medical Institute, Department of Cell Biology, Baylor College of Medicine, Houston, Texas 77030 
Communicated by Earl W. Davie, December 29, 1982 



ABSTRACT A human liver cDNA library was constructed 
by using poly(A)-containing RNA isolated from a human liver bi- 
opsy specimen. This library is comprised of 40,000 independent 
transformants with an average inserted DNA length of 1,200 base 
pairs. By using the previously cloned baboon antithrombin III 
cDNA as a specific hybridization probe, >30 human antithrom- 
bin HI cDN A clones were identified from this library. The clone 
with the longest DNA insert was selected for sequence analysis. 
This antithrombin m cDNA clone contains 1,479 base pairs of 
inserted human DNA and was designated phATIII 113. It con- 
tains DNA sequences that code for a signal peptide and the en- 
tire mature antithrombin m protein which is comprised of 432 
amino acid residues. 

Antithrombin HI is a plasma protease inhibitor synthesized in 
the liver. The glycoprotein has a molecular weight of 55,000 
and its entire amino acid sequence has almost been completed 
(1). It is a natural anticoagulant in that it specifically inhibits 
a number of serine proteases that participate in the blood co- 
agulation cascade, including thrombin, factors IXa, Xa, XIa, 
and Xlla (2-5). The mechanism of inhibition involves the stoi- 
chiometric formation of protease-antiprotease complexes and 
the rate of complex formation is gready enhanced in the pres- 
ence of heparin, which is a well-known anticoagulant used 
clinically in myocardial infarction and surgery (6, 7). Defi- 
ciency of antithrombin III is a hereditary disorder that is as- 
sociated with recurrent thrombophlebitis, acute aortic throm- 
bosis, and thromboembolism (8-10). Heterogeneity of the 
classical antithrombin III deficiency has been observed (11). 
Abnormal antithrombin III also has been isolated from defi- 
cient patients and partially characterized (12), suggesting that 
the deficiency could be the result of mutations in the anti- 
thrombin III gene itself. Therefore, the genetic deficiency can 
be analyzed in molecular detail if the human antithrombin III 
gene can be isolated and characterized. We recendy have re- 
ported the purification of antithrombin III mRNA from a ba- 
boon liver by polysome immunoprecipitation and the cloning 
of its cDNA (13). In this paper, we report the construction of 
a human liver cDNA library and the identification and se- 
quence of a full-length human antithrombin III cDNA clone. 

MATERIALS AND METHODS 

Construction of a Human Liver cDNA Library. A human 
liver biopsy specimen was kindly provided by Bill Williams 
(University of Texas Medical School in Houston). Total nu- 
cleic acid was extracted from the frozen tissue with phenol 
(14), and poly(A)-containing RNA was prepared by oligo(dT)- 
cellulose column chromatography (15). The RNA preparation 

The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertise- 
ment" in accordance with 18 U. S. C. §1734 solely to indicate this fact. 



was used for cDNA synthesis under conditions that favored 
full-length cDNA production (16). The cDNA preparation was 
sedimented through an alkaline sucrose gradient (16) and only 
fractions containing cDNA species of 1,000 nucleotides or more 
were pooled. The single-stranded cDNA was subsequendy 
made double-stranded with reverse transcriptase (17). After 
nuclease SI treatment, tracts of poly(dC) were added to the 
3' termini of the DNA molecules by using terminal transfer- 
ase (18). The enzymatically synthesized cDNA was rehybrid- 
ized with Pst I-linearized pBR322 DNA that had been tailed 
with poly(dG) and the DNA was used for transformation of 
Escherichia coli RR1 (18). Bacterial transformants were se- 
lected for resistance to tetracycline and «40,000 individual 
recombinants containing human liver cDNA sequences were 
obtained. Analysis of the lengths of the inserted DNA in 20 
randomly selected colonies by minilysis (19) and electropho- 
resis has indicated that >95% of the transformants were in- 
deed recombinants, and the average length of human DNA 
in these recombinants was —1,200 base pairs. 

Identification of Human Antithrombin HI cDNA Clones 
from a Human Liver cDNA Library. The human liver cDNA 
library was screened by colony hybridization (20) by using a 
nick-translated Pst I fragment from a baboon antithrombin III 
cDNA clone (13). Recombinants containing human antithrom- 
bin III DNA sequences were identified and the lengths of 
DNA inserts in these clones were analyzed by both agarose 
and polyacrylamide gel electrophoreses. The clone with the 
largest DNA insert, designated phATIII 113, was chosen for 
DNA sequence analysis by the method of Maxam and Gilbert 
(21). 

RESULTS 

By using human liver po!y(A)-containing RNA and standard 
cloning procedures, a cDNA library comprised of 40,000 in- 
dependent transformants with an average inserted DNA length 
of 1,200 base pairs was constructed. Screening of the library 
with a baboon antithrombin III cDNA clone (13) under strin- 
gent hybridization conditions led to the identification of >30 
positive clones. Analysis of plasmid DNAs isolated from these 
human cDNA clones by a minilysis procedure and agarose gel 
electrophoresis showed that they contained common internal 
restriction fragments. The recombinant containing the largest 
DNA insert was selected for DNA sequence analysis. The clone, 
designated phATIII 113, contained about 1,500 base pairs of 
human DNA and was subjected to sequence analysis by the 
strategy shown in Fig. 1. Throughout most of the DNA mol- 
ecules, both DNA strands were subjected to sequence anal- 
ysis. When only one DNA strand was subjected to sequence 
analysis, at least two experiments were performed that in- 
volved the use of independent labeling sites. Finally, all la- 

*To whom correspondence should be addressed. 
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Fig. 1. Sequence analysis strategy of the human antithrombin III cDNA clone. The restriction sites used for labeling are shown, but they do 
not represent all such sites present on the human DNA. The 5' ends of DNA fragments were labeled with [^ 32 P]ATP employing polynucleotide 
T4 kinase. The FnuAH site at the 3 ' end was labeled with the appropriate [ot-^JdNTP by using the Klenow fragment of E. coli DNA polymerase. 



beling and overlapping sites were subjected to sequence anal- 
ysis to ensure the accuracy of the data. 

The inserted human DNA sequence was 1,479 nucleotides 
in length and was flanked by poly(dG) and poly(dC) of 7 and 
13 residues at the 5' and 3' termini, respectively (Fig. 2). 
Comparison of the DNA sequence with the known amino acid 
sequence of the protein (1) has shown that the codon for the 
amino-terminal histidine present in the mature plasma protein 
was located at nucleotides 97-99 in the cDNA clone. TTie rest 
of the amino acid sequence derived from the DNA sequence 
agrees perfectly with the published amino acid sequence of 
human antithrombin III (1). Finally, the amino acid sequence 
in a "gap" region of the protein (residues 214-221) has been 
determined to be Val-Leu-Val-Asn-Thr-Ile-Tyr-Phe from the 
DNA sequence. 

Hie amino acid residue immediately preceding the first 
amino acid of the mature protein was not a methionine. Be- 
cause antithrombin III is a secretory protein synthesized in 
the liver, the presence of a signal peptide at the NH 2 ter- 
minus of the protein could be expected for intracellular mem- 
brane transport (22). Indeed, the DNA sequence maintained 
an open reading frame at this region and the first methionine 
codon was located at the 5' terminus of the cDNA clone at 
position -32 (Fig. 2). If this methionine residue were the 
starting site, the signal peptide is 32 amino acids in length. 
However, another methionine residue farther upstream can- 
not be ruled out as the starring site for the signal peptide. 

In addition, DNA sequence analysis at the 3' terminus of 
this human antithrombin III clone has revealed a stretch of 
poly(dA) preceding the po!y(dC) residues. This indicates that 
phATIII 113 contains human DNA sequences that code for all 
amino acid residues in antithrombin III, most if not all of the 
leader sequence, and all of the 3' untranslated region of the 



mRNA. Like other eukaryotic mRNAs, it also contains the 
consensus hexanucleotide A-A-T-A-A-A sequence in close 
proximity to the 3' terminus which is necessary for polyade- 
nylylation of the RNA. 

DISCUSSION 

A human liver cDNA library has been screened by using a 
previously reported baboon antithrombin III cDNA clone (13) 
and >30 human antithrombin III cDNA clones were identi- 
fied. The inserted human DNA in the largest recombinant 
plasmid was subjected to sequence analysis in its entirety and 
its corresponding amino acid sequence has been deduced. From 
the nucleotide sequence, it is apparent that human antithrom- 
bin III, being a plasma protease inhibitor synthesized in the 
liver, contains a signal peptide of 31 or more amino acids which 
is involved in intracellular transport through the endoplasmic 
reticulum (22). 

Having cloned and characterized the human antithrombin 
III cDNA, it would be possible to analyze the familial anti- 
thrombin III deficiency by gene mapping, much in the same 
way as the cloned human or- and 0-globin genes were utilized 
in the analysis of various hemoglobinopathies. This type of 
analysis has led to the development of gene mapping meth- 
odologies for prenatal diagnosis of various thalassemias and 
sickle cell anemia by genetic polymorphism linkage to the he- 
reditary disorders (23, 24). Recently, our laboratory also has 
been able to use a cloned human phenylalanine hydroxylase 
cDNA probe to analyze classical phenylketonuria by restric- 
tion fragment-length polymorphism (unpublished data). Sub- 
sequently, methods for direct analysis of the point mutation 
in the sickle cell trait also have been developed, mainly by 
identification of restriction enzymes that could distinguish the 



Fig. 2 (on following page). The complete nucleotide sequence of the human DNA insert in phATIII 113 and the amino acid sequence deduced 
from the DNA sequence. The amino acid residues in the mature protein are numbered 1 through 432, and those in the putative signal peptide are 
numbered -1 through -31, with the initiation codon for methionine as number -32. The numbering of the nucleotide sequence starts with the 
ATG initiation codon for the putative pre-anti thrombin m protein. 
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-32 -20 

NET TYR SCR ASM VAL ILE GLY THR VAL THft SER GLY LYS ARG LYS VAL TYR LEU LEU SER 
(QjCA TQTATTCCAATQTGATAGQAACTGTAACCTCTGGAAAAAGGAAGG1TTATCTTTTGTCC 
r 10 20 30 40 50 60 

-XI t 

LEU LEU LEU ILE GLY PHC TRP ASP CVS VAL THR CYS HIS GLY SER PRO VAL ASP ILE CtS 
TTGCTGCTCA.TTGOCTTCTGGGACTGCOTGACCTGTCACGGAAGCCCTGTGGACATCTGC 
70 80 90 100 UO 120 

21 

THR ALA LYS PRO ARG ASP ILE PRO MET ASH PRO MET CYS ILE TYR ARG SER PRO GLU LYS 
ACAOCCAAGCCGCGGGACATTCCCATGAATC C.C ATGTGCAT TTACCGCTCCCCGGAGAAG 
130 140 ISO 160 170 180 

41. 

LYS ALA THR GLU ASP GLU GLY SER GLU GLH LYS ICE PRO GLU ALA TRR ASH ARG ARC VAL 
AAGGCAACTGAGGATOAGGGCTCAGAACAGAAGATCCCG.GAGGCCACCA.AC. CGGCGTGTC 
190 200 210 220 230 240 

€8 

TRP GLU LEU SER LYS ALA ASM SER ARG PHC ALA THR THR PKE TYR GLH HIS LEU ALA ASP 
TGOGAACTGTCCAAGGCCAATTCCCGCTTTGCTACCACT TT CTATC AGCACCTGGCAGAT 
250 260 270 280 290 300 

M 

SER LYS ASM ASP ASM ASP ASM ILE PHE LEU SER PRO LEU SER ILE SER THR ALA PHE ALA 
TCCAAGAATGACAATGATAACATTTTCCTGTCACCCCTGAGTATCTCCACGGCTTTTGCT 
310 320 330 340 330 360 

10t 

NET THR LYS LEU GLY ALA CYS ASM ASP THR LEU GLN GLH LEU MET GLU VAL PHE LYS PHE 
ATGACCAAGCTOGGTGCCTGTAATGACACCCTCCAGGAACT GATGGAGGTAT TTAAGT TT 
370 380 390 400 410 420 

121 

ASP THR ILE SCR GLU LYS THR SER ASP GLH ILE HIS PHE PKE PHE ALA LYS LEU ASM CYS 
GACACCATATCTGAGAAAACATCTGATCAGATCCACTTCTTCTTTGCCAAACTGAACTGC 
430 440 450 460 470 480 

141 

ARG LCU TYR ARG LYS ALA ASM LYS SCR SER LYS LEU VAL SER ALA ASM ARG LEU PHE GLY 
CGACTCTATCGAA. AAGCCAA.CAAATCCTCCAAGTTAGTGTCAGCCAATCGCCTTTTTOGA 
490 300 510 520 530 540 

1M 

ASP LYS SCR LEU THR PHE ASM GLU' THR TYR GLN ASP ILC SCR GLU LEU VAL TYR GLY ALA 
GACAAATCCCTTACCTTCAATGAGACCTACCAGGACATCAGTGAGTTGGTATATGGAGCC 
SSO 560 570* 580 590 600 

in 

LYS LEU GLN PRO LEU ASP PHE LYS GLU ASM ALA GLU GLN SER ARG ALA ALA ILE ASH LYS 
AAOCTCCAGCCCCTGGACTTCAAGGAAAATGCAGAGCAATCCAGAGCTGCCATCAACAAA 
610 620 630 640 650 660 

200 

TRP VAL SCR ASM LYS THR GLU GLY ARG ILC THR ASP VAL ILC PRO SER GLU ALA ILE ASN 
TGGGTGTCCAATAAGACCGAAGGCCGTATCACCGATGTCATTCCCTCGGAAGCCATCAAT 
670 680 690 700 710 720 

221 

QLU LEU THR VAL LEU VAL LEU VAL ASN THR ILE TYR PHE LYS GLY LEU TRP LYS SCR LYS 
GAGCTCACTGTTCTGGTGCTGGTTAACACCATTTACTTCAAGGGCCTGTGGAAGTCAAAG 
730 740 750 760 770 780 

241 

PMC 8CR PRO GLU ASM THR ARG LYS GLU LCU PME TYR LY8 ALA ASP GLY GLU SER CYS SER 
TTCAGCCCTGAGAACACAAGGAAGGAACTGTTCTACAAGGCTGATGGAGAGTCGTGTTCA 
790 800 810 820 830 840 

261 

ALA SCR MET NET TYR GLN GLU GLY LYS PHE ARG TYR ARG ARG VAL ALA GLU GLY THR GLN 
GCATCTATGATGTACCAAGAAGGCAAGTTCCGTTATCGGCGCGTGGCTGAAGGCACCCAG 
850 860 870 880 890 900 

281 

VAL LEU GLU LCU PRO PHC LYS GLY ASP ASP ILC THR MET VAL LEU ILE LEU PRO LYS PRO 
GTGCTTGAGTTGCCCTTCAAA 0 0 TGATGACATCACCATGGTCCTCATCTTGCCCAAGCCT 
910 920 930 940 950 960 

30t 

QLU LYS SER LEU ALA. LYS VAL GLU LYS GLU LEU THR PRO GLU VAL LEU GLH GLU TRP LEU 
GAGAA8A0CCTGGCCAAGGTAQAGAAGGAACTCACCCCAGAGGTGCTGCAAGAGTGGCT0 
970 980 990 1000 1010 1020 

328 

ASP GLU LEU GLU GLU MET MET LEU VAL VAL HIS NET PRO ARG PME ARG ILE GLU ASP GLY 
OATGAATTGOAGGAGATGATGCTGOTGGTCCACATGCCCCGCTTCCGCATTGAOGACGOC 
1030 1040 1050 1060 1070 1080 

348 

PMC SER LCU LYS GLU GLN LCU GLN ASP NCT GLY LCU VAL ASP LCU PHC SCR PRO GLU LYS 
TTCAGTTTGAAGGA0CAGCT6CAAGACATGGGCCTTGTCGATCTGTTCAGCCCTGAAAAG 
1090 1100 mo 1120 1130 1140 

361 

BCR LYS LCU PRO GLY ILC VAL ALA GLU GLY ARG ASP ASP LCU TYR VAL SCR ASP ALA PHC 
TCCAAACTCCCAGGTATTOTTGCAGAAGGCCGAGATGACCTCTATGTCTCAOATGCATTC 
1150 U60 1170 1180 1190 1200 

388 

HIS LV8 ALA PMC LCU GLU VAL ASM GLU GLU GLY SCR GLU ALA ALA ALA SCR THR ALA VAL 
CATAAGGCATTTCTTGAGGTAAATGAAOAAGGCAGTGAAGCAGCTGCAAGTACCGCTGTT 
1210 1220 1230 1240 1250 1260 

VAL ILE ALA GLY ARG SCR LCU ASM - PRO ASM ARG VAL . THR PHC LYS ALA ASN ARG PRO PHC 
0T0ATT0CT8GCCQTTC0CTAAACCCCAACAGGGTGACTTTCAAGGCCAACA0GCCTTTC 
1270 1280 1290 1300 1310 1320 

LCU VAL PMC ILC ARG GLU VAL PRO LEU ASN THR ILE ILE PHC MET GLY ARG VAL ALA a8« 
CTGGTTTTTATAAGAGAAOTTCCTCTGAACACTATTATCTTCATGOOCAOAGTAGCCAAC 
*330 1340 1350 1360 1370 1380 

432 

PRO CYS VAL LYB 8*8 

CCTTOTOTTAAOTAAAATOTTCTTATTCTTTGCACCTCTTCCTATTTTTOGTTTOTGAAC 
»390 1400 1410 1420 1430 1440 

A8AA0TAAAAATAAATACAAACTACTTCCATC TCACAT T U) d (do 
1450 1460 1470 M80 



FlG. 2. (Legend appears at the bottom of the preceding page.) 
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mutated nucleotide in the 0-globin gene (25, 26). In cases such 
as a r antitrypsin deficiency, in which die point mutation does 
not create or destroy a restriction recognition sequence, spe- 
cific oligonucleotides can be synthesized and used to distin- 
guish the normal and mutated genes in chromosomal DNA 
(unpublished data). Thus, the cloning, sequence analysis, and 
comparison of the normal and deficient antithrombin III cDNAs 
should permit the development of such methodologies for 
prenatal diagnosis of antithrombin III deficiency. Further- 
more, because phATUI 113 reported here apparently contains 
all of the peptide-coding sequences, it should be possible to 
use DNA engineering technology for production of this nat- 
ural anticoagulant for therapeutic application in deficient in- 
dividuals as well as patients with other coagulation compli- 
cations. 

Antithrombin III shares significant amino acid sequence 
homology with human ai-antitrypsin (1, 27, 28), which is the 
major plasma protease inhibitor that serves as the principal 
neutralizing factor for polymorphonuclear leukocyte elastase 
(29). Comparison of the nucleotide sequences between the 
human cDNA clones revealed a homology level of 46.5%. 
Further comparison of the two sequences by computer-as- 
sisted dot-matrix analysis (30) has not revealed any regions in 
the genes that are particularly homologous with each other 
and the sequence homology is spread rather evenly through- 
out the molecules. Surprisingly, the two plasma protease in- 
hibitors also share significant sequence homology with chicken 
ovalbumin (28), which is the major egg-white protein and has 
no apparent protease inhibitor activity. The three proteins have 
been classified as members of a superfamily that had diverged 
over 500 million years ago (31). We recently have reported the 
cloning and characterization of the human chromosomal a r 
antitrypsin gene (31). Comparison of its molecular structure 
with that of the chicken ovalbumin gene showed that the 
number, size, and positioning of the two sequence-related genes 
are completely different, suggesting that intronic sequences 
also could be inserted into preexisting exonic sequences if the 
two genes arose by divergent evolution (31). Because the ex- 
tent of amino acid sequence homology between antithrombin 
III and a r anti trypsin is greater than that between ^-anti- 
trypsin and chicken ovalbumin, it would be interesting to ex- 
amine the genomic organization of the antithrombin III gene 
and compare its molecular structure with those of the human 
ai-antitrypsin and chicken ovalbumin genes. These studies 
could lead to a better understanding of the evolutionary origin 
of this interesting gene family. 

The authors are indebted to Dr. Bill Williams of the University of 
Texas Medical School at Houston for providing us-with the human liver 
biopsy specimen for construction of the cDNA library. We also thank 
Dr. Charles Manner and Ms. Wanda Beattie for helpful discussion and 
Ms. Sharon Moore for excellent technical assistance. The work was par- 
tially supported by Grant HL-27509 from the National Institutes of 
Health, and S.L.C. W. is an Investigator of the Howard Hughes Medical 
Institute. 
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preparation via recombinant DNA technology, form the 
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Human ant i thrombin III.DNA sequences therefor, expression 
vehicles and cloning vect ors containing such sequences 
and cell cultu res transformed thereby, a process for 
expressing human antithro mbin III, and pharmaceutical 
compositions comprising it. 

Background of the Invention 

Antithrombin III (ATIII) plays a critical role in 
maintaining the fluidity of blood. Blood coagulation is 
mediated by a series of serine proteases. Antithrombin III is 
a potent inhibitor of Factors IXa. Xa (Kurachi et aK. 
1976) 1 , XI (Oamus et aK . 1973), XI la (Stead et al. f I976h 



Reference is made to the appended bibliography which is 
hereby made a part hereof, the publications and other materials 
there correspondingly cited in more detail being incorporated 
herein by reference. 
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and thrombin (Rosenberg et aj^., 1973). Thus, AT 1 1 1 regulates 
clot formation both at the activation level and the thrombin 
level. The physiological importance of AT III in preventing 
excessive coagulation is revealed by studies of individuals 
5 whose antithrombin levels are decreased due to heredity 
(Egeberg, 1965; Odegard £t aK , 1977 ; Van der Meer et^ aU , 
1973; and Sas et^ ajk , 1974) or acquired deficiency (Abildgaard 
et aK, 1970; Mannucci ejt a_K . 1973 ; Fagerhol et a]_. , 1970). 
Such persons are prone to spontaneous thrombosis and the 
10 associated risks of disseminated intravascular coagulation 

(0IC), cardiac infarction, cerebrovascular accident, pulmonary 
embolism, etc. Transfusion of patients suffering from severe 
bleeding disorders complicated by 0IC with antithrombin III 
concentrates obtained by blood fractionation has suggested that 
15 such replacement therapy can restore normal hemostatic function 
(Schipper et aK , 1978) . 

Human antithrombin III is a single-chain glycoprotein 
synthesized by the liver and found in plasma at a concentration 
of approximately 20 mg/dl (Col len et aU , 1977 ). Molecular 
20 weight determinations for antithrombin III have yielded values 
between 54,000 and 65,000 daltons (Rosenberg and Oamus, 1973; 
Nordenman et. aK , 1977 ; Kurachi et aj_. , 1976), of which some 10 
percent is carbohydrate (Kurachi et aU , 1976). The primary 
amino acid structure of human antithrombin III has been almost 
25 completely determined by Petersen et aU (1979) who reported 
that this protein has approximately 430 amino acid residues, 4 
glucosamine-based oligosaccharide side units, and 3 disulfide 
bridges. However, these workers were uncertain as to the 
precise identity of several amino acids, and moreover, did not 
identify the sequence and length of an amino acid stretch in 
30 the middle of the protein due to incompletely overlapped 
peptides. 
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ATIII inactivates thrombin by forming a covalent 1:1 
stoichiometric complex with the protease (Rosenberg and Damus, 
1973; Owen, 1975). The anticoagulant effect of antithrombin 
III is enhanced by heparin, which greatly increases the rate of 
inhibitor-protease complex formation (Abildgaard, 1968). 

It was thought possible that therapeutic administration of 
human antithrombin III produced by genetically engineered 
microorganisms would also be useful in the clinical prevention 
and management of thromboses. Such genetic engineering methods 
would provide sufficient quantities of material so as to enable 
its clinical testing as a demonstration of the safety and 
efficacy prerequisites to marketing. Therefore, the task was 
undertaken of cloning the gene for human antithrombin III and 
expressing it in a host cell. 

Summary of the Invention 

The present invention is directed to the means and methods 
of producing human antithrombin III via recombinant DNA 
technology, including 1) the discovery and identity of the 
entire DNA sequence of the mature protein as well as its signal 
polypeptide, and the 3'- and 5«-flanking regions thereof; 2) 
the construction of cloning and expression vehicles comprising 
said DNA sequence, enabling the amplification, and thence, 
expression of the mature human antithrombin III protein, as 
well as Met, fusion, or signal N-terminus conjugates thereof; 
and 3) viable cell cultures, genetically altered by virtue of 
their harboring such vehicles and capable of producing human 
antithrombin III polypeptide. Further, this invention provides 
human antithrombin III in physical state distinct from its 
existence in, or isolation from, a natural environment or 
source, it by virtue of its method of preparation herein, being 
essentially free of usual endogenous proteins and other native 
materials or substances. 
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This invention is directed to the recombinant OWA 
production of human an t i thromb i n III in all of its aspects, and 
is not to be construed as limited to any specific details 
described herein and embraced within the compass of this 
5 invention. For example, the term "mature" as used herein 
connotes human ATI 1 1 as well as methionyl as a first amino 
acid, present by virtue of the ATG transl ational codon in the 
expression vector construction hereof. 

10 Description of Preferred Embodiments 

The work described herein was performed employing, inter 
ajj^a, the microorganism £^ col i K-12 strain 294 (end A, thi~, 
hsr", k hsm + ), as described in British Patent Application 
Publication No. 2055382A. 
15 This strain has been deposited with the American Type 

Culture Collection, ATCC Accession No. 31446, on October 28, 
1978. However, various other microbial strains are useful, 
including known E. col i strains such as E. col i B, coli 
xl776 (ATCC No. 31537, deposited July 3, 1979) and E. col i 
20 W3110 (F~, x", protrophic) (ATCC No. 27325), or other 

microbial strains many of which are deposited and (potentially) 
available from recognized microorganism depository 
institutions, such as the American Type Culture Collection 
(ATCC)--cf. the ATCC catalogue listing. See German 
25 Offenlegungsschrif t 2644432. These other microorganisms • 
include, for example, Bacilli such as Bacillus subtilis and 
other enterobacteriaceae among which can be mentioned as 
examples Salmonella typhimurium and Serratia marcesans » 
utilizing plasmids that can replicate and express heterologous 
30 gene sequences therein. 

Expression plasmids for bacterial use, e.g., E. coli are 
commonly derived using pBR322 as vector and appropriately 
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inserting the heterologous gene sequence together with 
transl ational start and stop signals in operable reading phase 
with a functional promoter, taking advantage of common or 
synthetically created restriction sites. The vector will carry 
one or more phenotypic selection characteristic genes and an 
origin of replication to insure amplification within the host. 
Again, the heterologous insert can be aligned so as to be 
expressed together with a fused presequence, derivable for 
example from the trp system genes. 

The present invention may also employ various yeast 
strains, hosting compatible expression vectors, such as the 
plasmid YRp7 (see Stinchcomb et^ aK , Nature 282 , 39 (1979)), 
which is capable of selection and replication in both E. col i 
and yeast, particularly Saccharomyces cerevisiae . A useful 
strain is strain RH218 (Mioggari et a^., J. Bacteriology 134, 
48 (1978)) deposited with the American Type Culture Collection 
without restriction (ATCC No. 44076). 

To express a heterologous gene such as the cONA for human 
-anti thrombin III in yeast, it is necessary to construct a 
plasmid vector containing four components. The first component 
is the part which allows for transformation of both E. col i and 
yeast and thus must contain a selectable gene from each 
organism. This can be the gene for ampicillin resistance from 
£• coH and the gene TRP 1 from yeast. This component also 
requires an origin of replication from both organisms to be 
maintained as a plasmid DNA in both organisms. This can be the 
£• col i origin from pBR322 and the ars l origin from chromosome 
III of yeast or the origin of replication from 2y circle 0NA. 

The second component of the plasmid is a S'-flanking 
sequence from a highly expressed yeast gene to promote 
transcription of a downstream-placed structural gene. The 
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S'-flanking sequence can be that from the yeast 3-phospho- 
glycerate kinase (PGK) gene. The fragment is constructed in 
such a way so as to remove the ATG of the PGK structural 
sequence, replaced with a sequence containing alternative 
5 restriction sites, such as Xba l and Eco Rl restriction sites, 
for convenient attachment of this 5'-flanking sequence to the 
structural gene. 

The third component of the system is a structural gene 
constructed in such a manner that it contains both an ATG 
1G transl at ional start and tr ans 1 at i onal stop signals. 

The fourth component is a yeast DMA sequence containing the 
3'-flanking sequence of a yeast gene, which contains the proper 
signals for transcription termination and po 1 yadeny 1 a t i on . 
For example, plasmids directing the production of 
15 methionyl-ATI 1 1 and pre AT 1 1 1 in yeast can be constructed by 
respectively inserting gene fragments for the mature protein 
and mature protein plus signal peptide into the Eco RI site of 
8.6 kbp expression plasmid YEplPT. (Hitzeman et. dj_ t Proc. of 
Berkeley Workshop on Recent Advances in Yeast Molecular 
20 Biology , 20-22 May '82, University of California, Berkeley.) 
The YEplPT vector contains a pBR322 origin to allow bacterial 
replication as well as a yeast 2u origin. YEplPT also carries 
an ampicillin resistance marker as a selectable trait in 
£• col i and the yeast TRP1 gene which is used to maintain the 
2S plasmid in tryptophan auxotrophic yeast strains. A unique Eco 
RI cloning site is immediately downstream of a 1.6 kb yeast ON A 
fragment carrying the efficient promoter for the 
3-phosphoglycerate kinase (PGK) gene. The AT 1 1 1 structural 
gene fragments generated by complete Eco RI and partial Pst 1 
30 digestion of pATIII-E7 or pATI 1 1- J4 (see infra . ) can be 
inserted into the YEplPT EcoRI site adjacent to the PGK 
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promoter. A 247 bp Pstl-EcoRI ONA fragment derived from yeast 
2u circle DNA is useful to convert the Ps_tl sites at the 3' 
ends of the AT 1 1 1 inserts into EcoRI ends; this convertor 
fragment also provides 2» terminator function. 
5 Similarly, the present inventions can employ various cell 

culture systems with appropriate vectors. One useful host for 
the production of heterologous protein is the C0S-7 line of 
monkey kidney fibroblasts (Gluzman, Cell 23. 175 (1981)). 
However, the present invention could be practiced in any cell 
10 line that is capable of the replication and expression of a 
compatible vector, e.g., WI38, BHK, 3T3, CHO , VERO, and HeLa 
cell lines. Additionally, what is required of the expression 
vector is an origin of replication and a promoter located in 
front of the gene to be expressed, along with any necessary 
15 ribosome binding sites, RNA splice sites, polyadenylation site, 
and transcriptional terminator sequences. It will be 
understood that this invention, although described herein in 
terms of a preferred embodiment, should not be construed as 
limited to these sequences. For example, the origin of 
20 replication of SV40 and other viral (e.g., Polyoma. Adeno, VSV, 
BPV, and so forth) vectors could be used, as well as cellular 
origins of ONA replication which could function in a 
nonintegrated state. 

The strategy for the synthesis of heterologous peptide in 
25 mammalian cell culture relies on the development of a vector 
capable of expression of a foreign gene under the control of a 
transcriptional unit. The replication of this vector in tissue 
culture can be accomplished by providing a DNA replication 
origin (such as from SV40 virus), and providing helper function 
30 (T antigen) by the introduction of the vector into a cell line 
endogenously expressing this antigen (Lusky et aJL , Nature 293 , 
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79 (1981)). A promoter of SV40 virus precedes the structural 
gene and ensures the transcription of the gene. Alternatively, 
the expressed gene can be incorporated into the cellular genome 
via procedures known per se so as to ensure stable transmission 

5 of the gene. 

A useful vector to obtain expression consists of pBR322 
sequences which provides a selectable marker for selection in 
Z. col i (ampicillin resistance) as well as an E. col i origin of 
DNA replication. These sequences are derivable from the 

10 plasmid pML-1 (Lusky e^ a_K , Nature 293 , 79 ( 1981 )). The SV40 
origin is derivable from a 342 base pair Pvu l I - Hind i 1 1 fragment 
encompassing this region (Fiers et a± , Nature 273 , 113 ( 1978)) 
(both ends being convertable to Eco RI ends). These sequences, 
in addition to comprising the viral origin of DNA replication, 

15 encode the promoter for both the early and late transcriptional 
unit. The orientation of the SV40 origin region is such that 
the promoter for the early transcriptional unit is positioned 
proximal to the gene encoding human antithrombin III. 

20 Brief Description of the Drawings 

Figure 1 depicts antithrombin III mRNA and cDNA clones. 
The top line represents the mRNA coding for human antithrombin 
III. DNAs employed as primers and hybridization probes are 
indicated by bars and the letters a-d above the mRNA. a) pool 

2 * of eight N-14 mers, b) pool of eight i-16 mers, c) i-14mer- 
d) 230 bp Ode I fragment. The initial and final amino acid 
codons of the mature AT 1 1 1 polypeptide are indicated by the 
circled 1 and 432 respectively. Key restriction endonuclease 
sites are shown by vertical lines. An approximate size scale 

30 in nucleotides is included. 

Figure 2 shows the nucleotide and amino acid sequence of 
human antithrombin III. The nucleotide sequence of the human 
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ATIII mRNA was determined from ON A sequence analysis of the 
cDNA clones pA62 and pA68. Predicted amino acids of the signal 
and mature ATI II polypeptide are shown above the DNA sequence 
and are numbered from the first residue of the mature protein. 
5 The nucleotide sequence shown does not extend to the true 5 1 
terminus of AT 1 1 1 mRNA. 

Figure 3 depicts the construction of pATIII-E7, details of 
which are provided infra . (A) Synthesis of 45 bp EcjjR I -Hind 1 1 1 
fragment by DNA Pol I extension of overlapping synthetic 
10 oligonucleotides. (6) The synthetic fragment from (A), 

indicated by an open box, was ligated into the Ecjo R I - Hiji d I II 
sites of p8R322, whose genes for ampicillin (amp) and 
tetracycline (tet) resistance are shown by dark boxes. A 40 bp 
EcoRI-SacI fragment was isolated from pRlO and ligated to a 590 
15 bp fragment (shown by cross-hatched box) isolated from pA62. 
(C) EcoRI-SacI and SacI-PstI (dotted box) fragments were 
ligated into the trp expression vector fragment of LeIFA 
trp 103 (Gray et. aK , Nature 295 , 503 (1982)) to yield pTA2. 
(0) pTA2 is a tetracycline resistant plasmld containing AT 1 1 1 
20 structural gene sequences under control of the I. col i trj> 
operon. To correct a two nucleotide deletion found in the 
structural gene, the EcoRI-SacII fragment was replaced with 
fragments synthesized by different means. The resulting 
constructions, pATIII-E7 and pATIII-J4 f resulted respectively 
25 in synthesis of the human mature (with accommodation of 
methionyl first amino acid) AT 1 1 1 and human pre AT 1 1 1 
polypeptides in E. col i cells. 

Detailed Description 
30 Methods ; 

RNA preparation . Human liver messenger RNA was prepared by 
the guanidinium thiocyanate method (Ullrich et al., 1977) 
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followed by oligo dT cellulose chromatography (poly(A)+ RNA) 
or by phenol extraction of polysomes isolated in the presence 
of the potent ribonuclease inhibitor, hydroxyst i lbamid i ne 
isethionate (Lizardi, 1980) (polysomal RNA). To obtain 
approximately 3 mg of polysomal RNA, 5 g of liver were 
homogenized in 10 mM NaCl, 1.5 mM MgCl^, 1 percent NP40, 250 
mM sucrose, 10 mM Tris-HCl, pH 7.5, 1.5 mM hy droxys t i 1 b am i d i n e 
isethionate (Merrell, Cincinnati, OH). Nuclei and membranes 
were cleared from the nomogenate (750 xg, 5 m) , and the 
polysomes were spun thru 0.5 M sucrose in 0.3 M NaCl, 50 mM 
Tris-HCl, pH 7.5, 5 mM MgCl 2 , 1.5 mM hydroxys t i 1 b am i d i n e 
isethionate onto a pad of 1.0 M sucrose in the same buffer 
(12,000 xg, 20 m). The polysome band was phenol extracted and 
ethanol precipitated, resuspended in water and clarified by 
centrif ugation at 12,800 xg for 30 m. The resultant polysomal 
RNA preparations had A260/A280 ratios in the range 1.8 - 2.0. 

Poly A( + ) liver RNA was enriched for ATIII messages by 
centri f ugation through a linear sucrose density gradient (15 
percent - 30 percent sucrose in 0.1 M NaCl, 12.5 mM EOTA, 10 mM 
Tris, pH 7.5, 0.15 percent Sarkosyl, Beckman SW50.1, 47 Krpm, 4 
h, 20*C). Aliquots of the fractionated RNA were used to direct 
j_n vitro protein synthesis in a 35 S-methionine supplemented 
rabbit reticulocyte lysate (BRL , Bethesda, MD). AT I II 
mRNA-containing fractions were identified by immunoprec i pi tat ion 
of vn vi tro synthesized protein with rabbit antihuman ATIII 
antiserum (Atlantic Antibodies, New Brunswick, Maine) (Kessler, 
1976) and subsequent SOS gel electrophoresis (Laemmli, 1970). 

Synthesis, Cloning and Screening of cDN A . Reverse 
transcription reactions (containing from 0.1 - 5 w g RNA) were 
primed with the synthetic DNA fragments indicated in the text 
(approximately 200 ng) or oligo dT fl? . (Collaborative) 
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(approximately 1 ug). Double stranded cDWA was prepared* oligo 
dC-tailed, inserted into Ps t l cleaved G-tailed pBR322 and 
transformed into E. col i 294 as previously described (Lawn 
et aK, 1981) . 

32 

5 col j transf ormants were screened with Jfc P 

32 

5 '-end-l abeled synthetic oligonucleotides or P-l abe 1 ed DNA 
prepared by random priming of given restriction fragments as 
previously described in Lawn e_t aK (1981). The synthetic DNA 
fragments used as primers and probes were prepared by the 
10 phosphotriester method (Crea and Horn. 1980). Procedures for 
0NA preparation and restriction enzyme analysis have also been 
published (Lawn et al_. , 1981). 

Hybridization Selection of ATI 1 1 mRNA . Approximately 4 ug 
of a 230 bp DNA fragment from plasmid pA3 was coupled to 25 mg 
15 DBM-cel lulose using the procedure developed by Noyes and Stark 
(1975). This ON A cellulose was used to isolate ATI I I-specif i c 
mRNA from 1 mg of human liver polysomal RNA utilizing the 
conditions described in Bock et^ al_. (1982). 

Construction of the expression plasmid . Procedures for 0NA 
20 fragment isolation and conditions for the ligation reactions 
have been published elsewhere (Lawn et a_K , 1931) and are 
applicable herein. Synthesis of fragments connecting the 
promoter and 5' end of the gene are described below. 

Synthesis of Eco RI - Hind III fragment . The principles 
25 underlying the design of the 36-base (dCTAGAATTCTATGCACGGCTCG 
CCAGTGGACATCTG) and 37-base (dCGCAAGCTTCCGCGGCTTGGCTGTGCAGAT 
GTCCACT) oligonucleotides used to synthesize the 45 bp Eco RI - 
Hindi II fragment are presented in detail infra . The 36-mer and 
37-mer were phosphorylated at their 5 1 ends with T4 
30 polynucleotide kinase (P-L), and 2 ug of each oligonucleotide 
were annealed together for 10 min at 68*C, 12 m at 20*C and 
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12 m at 4*C in a volume of 40 ul. The annealing reaction was 
supplemented with deoxynucleotide triphosphates to 400 »M each 
and 5 units of E. coli 0NA polymerase Klenow fragment (BRt) 
(total volume * 50 ul). The polymerization reaction was 
5 incubated for 30 min at 20 # C and 1 h at 37*C. 8uffer for the 
polymerization reaction contained 50 mM NaCl, 6 mM Tris.HCl, pH 
7.5, 6 mM Mg Cl 2 , 5 mM DTT, and 100 ug/ml BSA; the annealing 
reaction contained these same components at 1.25 X strengtn. 
After polymerization, the reaction mixture was deprote ini zed by 
10 phenol and chloroform extraction and digested sequentially with 
Hindlll and EcoRI. The resultant 45 bp fragment was purified 
by electrophoresis in a 15 percent polyacryl amide gel. 

Synthesis of 40 bp EcoRI - SacII fragment . A 1400 bp PsU 
fragment containing the 5' end of the AT 1 1 1 structural gene was 
15 isolated from pA62. 25 ug of this fragment and 3 w g of 5' 
phosphorylated 36-mer (used previously for synthesis of the 
EcoRI- Hind l 1 1 fragment) were incubated in 33 pi H 2 0 at 100*C 
for 5 m, then quick frozen. This annealing reaction was thawed 
and adjusted to a final volume of 50 ul containing 20 mM KC1, 8 
20 mM MgCl 2 , 30 mM DTT, 20 mM Tris.HCl, pH 8.3, 500 uM each 

dGTP, dATP, dCTP and dTTP, and 25 units reverse transcriptase 
(BRL). The reaction was incubated for 10 m at 20*C and 50 m at 
42*C. After deprotei ni zation, the DNA was treated with 7 units 
£. col j DNA polymerase Klenow fragment (Boehringer) as 
25 described previously. Oigestion with Sacll and EcoRI followed 
Klenow treatment and produced the desired 40 bp fragment which 
was gel isolated. 

ONA sequencing . DNA sequence analysis was performed by the 
methods of Maxam and Gilbert (1980) and Sanger, Nicklen and 
30 Coulson (1977). 

Protein Analysis . Cell pellets containing 1 A 55Q unit 
were obtained at various stages during the fermentation of 
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ant i thromb in expression plasmids ATIII-E7 and pATIII-J4 
(transformed into £. col i strain W3110). Each sample was 
resuspended in buffer containing 40mM imidazole CI, pH 7.5, 2 
percent SDS, 10 percent glycerol and 5 percent BME, and boiled 
5 for 3m, Samples were subjected to electrophoresis in the gels 
systems of Laemmli (1970) or Weber and Osborn (1969). Total 
protein was visualized by Coomassie Brilliant Blue staining and 
bacterially synthesized AT III was identified by Western blot 
analysis (Renart et a^, 1975). 

10 

Resul ts : 

Initial cONA clone for human antithrombin III . A 
recombinant plasmid containing 600 bp of the AT 1 1 1 structural 
gene was identified in a cDNA bank prepared from size 
15 fractionated RNA by hybridization to synthetic ONA fragment 
probes . 

The published partial protein sequence of human 
antithrombin III (Petersen e_t aK , 1979) was inspected for 
regions with limited codon degeneracy* Two such regions were 
20 identified and the corresponding synthetic DNAs of 14 or 16 
nucleotides in length were prepared (Table I). 
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TABLE I 



Am i n o 
Acid 

Posit ion* 



Protein 
Sequence 



Synthetic 
DNA Sequence 



Use 



S 



17 



Me tAsnProMe tCys 



G A 

5' dCACATAGGGTTCAT 3 1 



hybridization probe 



T 
C 



243 



T T A 

MetMetTyrGlnGluGly 5' dCCCTCCTGGTACATCA 3' hybridization 

probe; cONA primer 



10 



Synthetic DNA fragments used for cloning 
human ant i thromb in III 



♦Numbering system of Petersen £t aK , 1979. 

The more C-terminally located of these sequences, a' pool of 

15 eight 16 mers, was used to prime cDNA synthesis from poly(A)+ 
RNA which had been enriched for AT III mR.NA. Previous in vi tro 
translation studies have suggested that AT 1 1 1 mRNA constitutes 
only 0.5 percent of bovine liver poly(A)+ RNA (MacGi 1 1 i vray 
et_ aj_., 1979 ). We therefore enriched for AT 1 1 1 encoding RNA by 

20 sucrose density gradient cen tr i f uga t i on , and identified the 
desired fraction by i mmunoprec i pi ta t i on of AT III from in vitro 
translation assays. Approximately 250 tr ansf orman ts were 
obtained upon insertion of the oligo dC-ta.iled, double stranded 
cDNA produced by specifically primed reverse transcription of 

25 the size fractionated RNA into Ps_tl cleaved, oligo dG-tailed 
p BR3 22 . Colony hybridization of this clone bank to the 32 P 
5 '-end-1 abeled 14- and 16-base synthetic nucleotides (Table I) 
revealed one transformant with strong hybridization to the 
16mer, but not to the 14mer. DNA from this plasmid, designated 

3 0 pA3, was subjected to sequence analysis, and its nucleotide 
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sequence was shown to correspond to the protein sequence of 
human antithrombin III extending from amino acid 239 towards 
the C terminus for approximately 200 residues (Fig. 1). 

The identification of a clone encoding a portion of AT 1 1 1 
5 3» to amino acid residue 243 was unexpected since the cDNA from 
which this clone bank was constructed had been primed with a 
pool of 16-base oligonucleotides predicted to encode the region 
around amino acid 243. Transcripts from the complementary 
primer were therefore expected to extend in the 5' direction 
10 from amino acid 243 and should have not included sequences 3' 
to the probe. In actual fact, pA3 begins near to the 
C-terminus of the mature protein and extends for only 12 
nucleotides 5 1 to the probe sequence, thus hybridizing with the 
16mer pools, but not the 14mers (which are located in a more 
i5 N-terminal region of the protein). This particular cDNA clone 
perhaps resulted from the self-priming of AT III RNA species for 
the reverse transcription reaction. 

cDNA Clones for the Complete AT III Structural Gene . An 
internal restriction fragment from pA3 was used to purify the 
20 antithrombin III messenger RNA subsequently used for generating 
an oligo-dT primed cDNA clone bank. Two overlapping cDNA 
clones which together encode the entire human AT 1 1 1 structural 
gene and its associated 5* and 3* untranslated regions were 
isolated from this bank. 
25 0NA cellulose was prepared from the 230 bp Dde l fragment of 

pA3 and used to select complementary sequences from 1 mg of 
human polysomal liver RNA as outlined supra . In vitro 
translated protein from an aliquot of the bound RNA was 
considerably enriched for antithrombin III. The hybrid- 
30 selected RNA was used to direct oligo-dT primed cDNA 

synthesis. Double stranded cDNA was again cloned into the Pst I 
site of pBR322 through homopolymer i c dGC tailing, this time 
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yielding about 500 transf ormants . Quadruplicate filters of the 
clone bank were made and screened with 4 probes distributed 
along the length of the AT 1 1 1 structural gene (Fig. 1). The 
four hybridization probes were: (a) the pool of eight 

5 N-termina1 14 mers (N-14 mers) synthesized for the initial 
screening experiment, (b) the pool of eight 16 mers also used 
in the initial screening (i-16 mers), (c) a 14-base synthetic 
0NA (i-14mer, dTGAGGACCATGGTG) whose sequence had been 
determined by sequencing of pA3 and was complementary to the 

*0 coding strand at amino acid 271-276 (numbering scheme as per 
Petersen e_t a]_. , 1979), and (d) the 230 bp Ode I fragment from 
pA3 (which had also been used for mRNA hybridization 
selection) . 

28 of the 500 colonies generated by oligo dT priming of 

15 hybridization-selected RNA were positive when screened with the 
Dde l fragment used for RNA enrichment. The length of AT 1 1 1 
cDNAs inserted in these transf ormants was assessed by 
hybridization to the three more N-terminal probes: 20 were 
positive with the i — 14 mer probe, 17 with the pool of 16 mers, 

20 and 3 with the N-14 mers. The latter 3 plasmids had inserts of 
about 1.8 kb and were analyzed further. Partial DNA sequencing 
of these plasmids verified that they did indeed encode the 
amino terminus of mature human antithrombin III. 

Subsequent restriction mapping and DNA sequencing of one 

25 plasmid, pA62, revealed the primary structure of an AT III mRNA 
composed of an extended 5* untranslated region and a coding 
region of 1390 bps. pA62, however, was incomplete at the 3 1 
end of the gene. The homopol ymer i c dGC tails began after the 
second nucleotide of the codon for the carboxyl terminal amino 

30 acid residue. Thus, this plasmid was missing the last 

nucleotide of the C-terminal codon, the stop codon and the 3' 
untranslated region. 
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Further analysis of partial length AT 1 1 1 cONA clones in the 
library constructed from oligo dT primed, hybridization-enriched 
mRNA revealed a plasmid, pA68, which contained 400 bps from the 
3' end of the antithrombin III structural gene, an 84 bp 3' 
5 untranslated region, and a poly(A) tail. Sequence analysis of 
pA68 and the overlapping clone, pA62, has provided the complete 
primary structure of the human antithrombin III gene and 
portions of its flanking, untranslated regions. In addition, 
common restriction sites within the overlapping regions of the 
10 partial cONA clones have been utilized to construct a full 
length structural gene which has been inserted into an 
expression plasmid so as to direct the synthesis of human 
antithrombin III under trp operator-promoter control in E. col i . 
Sequence and Structure of the human antithrombin III 
15 messenger RNA . Nucleotide sequence analysis of the A62 and A68 
cDNA clones reveals that AT 1 1 1 mRNA has a 96 nucleotide signal 
sequence (32 amino acids), a 1296 nucleotide sequence encoding 
the mature protein (432 amino acids), and 84 residues in the 3' 
untranslated region. 
20 DNA sequencing reveals that a signal peptide of 32 residues 
probably precedes the mature N terminus of human antithrombin 
III. This sequence displays two features commonly attributed 
to signal peptide regions (Jackson and Blobel, 1980). First, a 
very hydrophobic region (LeuLeuSerLeuLeuLeuI 1 e) is observed 
25 about ten residues prior to the cleavage site. Second, the 
presence of a cysteine immediately prior to the mature N 
terminus is in agreement with the precedent that the amino acid 
residue immediately preceding the cleavage site has in all 
cases been one of the smallest amino acids. 
30 The mature form of human antithrombin III contains 432 
amino acid residues. The protein sequence determined by DNA 
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analysis agrees perfectly with the partial sequence published 
by Petersen ej. a_K ( 1979) and has provided definitive sequence 
information for the regions where the original amino acid 
determinations were ambivalent or unknown. Amino acid residues 

5 100, 101, 326 and 328 (numbering scheme as per Petersen) which 
were identified only as Glx by protein sequencing have been 
unambiguously determined as gin, gin, gin, and gin. Residue 
number 329 is an asp. ON A sequencing has also revealed that 
eight additional amino acids { Va 1 LeuVa 1 AsnTh r 1 1 eTyrPhe) occur 

10 between Leu 213 and Lys 214 of the sequence, which Petersen 
e t al . (1979) did not appreciate. 

Finally, the nucleotide sequence of AT III cONA indicates 
that the carboxy terminus of this protein is not processed, 
since the C-terminal residue of the mature protein, lysine, is 

15 immediately followed by a stop codon. The 3' noncoding region 
consists of 84 nucleotides following the termination codon 
UAA. The sequence AATAAA is present at minus 30 residues from 
the polyadenyl ati on site. 

20 Bacterial Expression of Human Ant i throtnb i n III 

pATIII-E7, a 7 kb plasmid producing 50,000 dalton 
meth iony l-ATI I I , was constructed by placing segments of partial 
cDNA clones pA62 and pA68 behind the Z. col i trp promotor (Fig. 
3). Construction of this expression plasmid necessitated the 

2 5 

synthesis of a DMA fragment incorporating an Eco R I cleavage 
site, 'an ATG translation initiation codon and the nucleotide 
sequence encoding the amino terminus of the mature structural 
gene, through the first unique restriction site. Sac 1 1 . We 
attempted to synthesize such a fragment by a combination of 
30 chemical and enzymatic procedures. A 36 base deoxyoligo- 

nucleotide was synthesized which contained 3 arbitrarily chosen 
nucleotides followed by the six base Eco RI recognition 
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sequence, then TATG and the 23 nucleotides encoding the first 
eight codons of mature human antithrombin III. 

This synthetic 36-mer was designed to provide optimal 
spacing between the ribosome binding sequence and the start of 
5 translation, as well as to minimize secondary structure in the 
5' end of the gene. To accomplish this, the triplets selected 
to encode amino acid residues 2, 3 and 4 do not correspond to 
the natural codons observed in the pA62 cDNA clone. A 37 base 
deoxyoligonucleotide which anneals to the 3' end of the 36 mer 
10 by 12 base pairs and extends towards the 3* terminus of the 
gene was also synthesized. Reading from its 3' to 5' end, the 
37 mer contains nucleotides coding for amino acid residues 4 
through 13 (containing a Sac 1 1 recognition sequence) and an 
artificial Hind i 1 1 site to be employed in propagating the 
15 linker fragment. The 36- and 37 mers were annealed, and the 
initial 12 bp long double stranded region was extended wi th DNA 
polymerase Klenow fragment. The duplex product was trimmed 
with EcoRI and Hind i II and the purified EcoRI to Hindi 1 1 
fragment was inserted into pBR322. DNA sequence analysis of 
20 the insert from this construction, pRIO, revealed that the 

synthetic DNA was missing 2 adjacent nucleotides of the desired 
sequence. 

The deletion may result from improper extension of this 
particular set of synthetic ONAs by the DNA polymerase 
25 employed. A method for correcting the 2-base deletion is 
discussed below. Before this was developed, the fragment 
containing the deletion was utilized as a convenient EcoRI to 
SacII linker for constructing a direct expression plasmid. 
The EcoRI to SacII fragment from pRIO was used to join 
30 segments of the pA62 and pA68 partial cDNA clones to the trp 

promoter in pBR322. Approximately 120 ng of the EcoRI to SacII 
fragment from pRIO was ligated to about 1 „g of the 590 bp 
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SacI I to SjkI fragment from pA62. A 630 bp EcoRI to Sac l ON A 
fragment was gel isolated following restriction of the ligation 
mixture with the appropriate enzymes. This Eco RI to Sac l 
fragment was joined to the 785 bp 0NA fragment produced by Sac l 

5 complete and Pst I partial digestion of pA68, and a pBR322 
expression vector having Eco RI and Pst I ends in a three part 
ligation reaction. The expression vector contains a trp 
promoter-operator fragment which is inserted at the Eco RI site 
of pBR322 and reads towards the amp gene. An Eco RI sequence 

10 follows the promotor ribosome binding site; and the Pst I site 
in the amp gene is used to fuse the vector to the 3* end of the 
structural gene. The product of this 3-part ligation is 
designated pTA2. 

A derivative pTA2 was made by substituting its 38 bp Eco RI 

15 to Sac II fragment with a 40 bp piece of synthetic DNA which 
contains an Eco RI site, initiation codon and intact coding 
sequence for amino acid residues 1 through 12 of AT III. This 
40 bp DNA fragment was synthesized by a process which utilized 
two different DNA polymerase enzymes. First, a 36 base long 

20 ; 

primer was hybridized to a template DNA fragment from pA62 and 
extended using the polymerase activity of AMV reverse 
transcriptase. (See supra , for a description of 36-mer.) 
Following primer elongation, the dual 3' to 5' exonuclease and 
5' to 3' polymerase activities of £. col i Klenow fragment were 
exploited to trim off excess single stranded template DNA and 
polymerize the first several codons of the structural gene, the 
translation initiation codon and the Eco RI tail. The prepara- 
tion was subsequently cut with Eco RI and Sac II and the 40 bp 
fragment was inserted into pTA2. pATIII-E7, the resultant 
30 plasmid, was verified to have the correct coding information 
between the Eco RI and Sac l I sites by DNA sequence analysis. 
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Furthermore, pAT 1 1 1 -E7 produces a 50,000 dalton protein under 
trp control which has been immunologically identified as 
antithrombin III. 

Additionally, pATIII-J4, a plasmid expressing 

5 pre-anti thrombin III under trp operator-promoter control has 
been constructed. pATIH-J4 was derived from pTA2 by 
replacement of the 38 bp Eco RI - Sac II fragment with a 145 bp 
Eco RI - Sac.II fragment encoding the presumptive 36 residue 
signal sequence of antithrombin III and the first 12 aminoacids 

10 of the mature polypeptide. This fragment was synthesized from 
a 24-base synthetic 0NA primer (dCTAGAATTCATGTATTCCAATGTG) and 
the 1400 bp pA62 Pst I fragment by sequential DNA polymerase I, 
Sac II and Eco RI treatments (Lawn e_t aU 1981). PATIII-J4 
expresses two proteins under trp control which react with ATI 1 1 

15 antisera. The major component has an apparent molecular weight 
approximating the calculated molecular weight of 
pre-anti thrombin III. A minor band was present that comigates 
with the ATI 1 1 band produced from pAT 1 1 1-E7 - 

20 inactivation of human thrombin by bacterially synthesized 
antithrombin III. 

ATI 1 1 inactivates thrombin by reacting with it to for'm a 
covalent, 1:1 stoichiometric complex (Jesty, 1979). For 
example, cell extract of E. coli W3110 transformed with 

25 pAT 1 1 1- J4 can be incubated with human thrombin in the presence 
of heparin. Following polyacryl amide gel electrophoresis, 
AT 1 1 1 thrombin complexes are detected by Western blotting 
(Renart, et aK , 1979) with rabbit anti-human AT 1 1 1 antiserum. 
The bacterial ATIII-human thrombin complex is observed in 

30 reactions to which thrombin has been added. 
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Pharmaceut i cal Comoos i t ions 

The compounds of the present invention can be formulated 
according to known methods to prepare pharmaceutical!/ useful 
compositions, whereby the human antithrombin III product hereof 

5 is combined in admixture with a pharmaceu t i cal 1 y acceptable 
carrier vehicle- Suitable vehicles and their formulation, 
inclusive of other human proteins, e.g. human serum albumin or 
plasma preparations, are described for example in Remi ngton 1 s 
Pharmaceutical Sciences Mark Publishing Co., Easton, Pa., 15th 

1° 2nd.,* 1975 , which is hereby incorporated by reference. Such 
compositions will contain an effective amount of the protein 
hereof together with a suitable amount of vehicle in order to 
prepare pharmaceutical ly acceptable compositions suitable for 
effective administration, preferably parenteral, to the host. 

15 Notwithstanding that reference has been made to particular 
preferred embodiments, it will be further understood that the 
present invention is not to be construed as limited to such, 
rather to the lawful scope of the appended claims. 
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Claims : 

1. Human antithrorobin III as a product of genetically altered 
cell culture. 

S 

2. Human anti thrombin III essentially free of native 
endogenous substances. 

3. Human antithrombin III according to Claim 1 unaccompanied 
10 by associated native glycosylation. 

4 # A 0NA sequence comprising a sequence encoding human 
antithrombin III. 

15 5. The DNA sequence according to Claim 4 including a sequence 
encoding the presequence peptide for human antithrombin 
III. 

6. A replicable cloning vehicle containing the 0NA sequence 
20 according to Claims 4 or 5. 

7. An expression vector comprising a DNA sequence encoding 
human antithrombin III operably linked to expression 
effecting DNA sequence and flanked by transl ational start 

25 and stop signals. 

8. A viable cell culture transfected with the expression 
vector of Claim 7. 

30 g. A cell culture of Claim 8 capable of producing mature 
human antithrombin III. 
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10. Plasmids pATI I I-E7 and pATIII-J4. 

11. A cell culture transfected with each of the plasmids 
according to Claim 10. 

5 

12. A composition comprising a therapeutically effective 
amount of human antithrombin III according to Claim 1 in 
admixture with a pharmaceutical ly acceptable carrier. 

10 

13. A culture of cells altered so as to direct the genetic 
expression of human antithrombin III. 

14. Human antithrombin III according to Claim 1 for use in 
IS treating cardiovascular diseases or conditions 

or in pharmaceutical compositions useful for such 
treatment . 

20 15. A process which comprises expressing a gene encoding human 
antithrombin III in a cell culture transfected with an 
expression vector operably harboring said gene. 
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